Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition System
نویسنده
چکیده
In this paper, improvement of an ASR system for Hindi language, based on Vector quantized MFCC as feature vectors and HMM as classifier, is discussed. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is to create delta and delta-delta coefficients and append them to MFCC to create feature vector. This paper focuses on all digits in Hindi (Zero to Nine), which is based on isolated word structure. Performance of the system is evaluated by accurate Recognition Rate (RR). The effect of the combination of the Delta MFCC (DMFCC) feature along with the Delta-Delta MFCC (DDMFCC) feature shows approximately 2.5% further improvement in the RR, with no additional computational costs involved. RR of the system for the speakers involved in the training phase is found to give better recognition accuracy than that for the speakers who were not involved in the training phase. Word wise RR is observed to be good in some digits with distinct phones.
منابع مشابه
Integration of articulatory dynamic parameters in HMM/BN based speech recognition system
In this paper, we describe several approaches to integration of the articulatory dynamic parameters along with articulatory position data into a HMM/BN model based automatic speech recognition system. This work is a continuation of our previous study, where we have successfully combined speech acoustic features in form of MFCC with articulatory position observations. Articulatory dynamic parame...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملSpeaker Dependent Speaker Recognition Using Svm and Hmm
Speaker recognition is the process of recognizing the speaker based on characteristics such as pitch, tone in the speech wave.Background noise influences the overall efficiency of speaker recognition system and is still considered as one of the most challenging issue in Speaker Recognition System (SRS). Support Vector Machine (SVM) and Hidden Markov Model (HMM) are widely used techniques for sp...
متن کاملOptimization of Features Parameters for HMM Phoneme Recognition of TIMIT Corpus
Phoneme is the smallest contrastive unit in the sound system of a language. Moreover, it has a meaningful role in speech recognition. In this study, we are interesting for phonemes recognition of Timit database using HTK toolkit for HMM. The main goal is to determine the optimal parameters for the recognizer. For this reason, different speech analysis techniques were operated such as Mel Freque...
متن کاملComparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system
Recently, the advantages of the spectral parameters obtained by frequency filtering (FF) of the logarithmic filter bank energies (logFBEs) have been reported. These parameters, which are frequency derivatives of the logFBEs, lie in the frequency domain, and have shown good recognition performance with respect to the conventional mel-frequency cepstral coefficients (MFCC) for HMM systems. In thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013